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Computational ethnomusicology, ethnic music processing: 
an unbiased(non-12Tet) audio signal processing application domain 
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Automatic Music Transcription 


Tempo(speed) 
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Meter: rhythmic parsing D„ra,ions ■> quantization 
A time-frequency representation 


Basic problems: 

Pitch estimation and quantization 
Duration estimation and quantization 


Turkish Music Notation 

Ebced notation: 13th century 
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Turkish Music Notation 


Hamparsum Limonciyan (1768-1839): Hamparsum notation 
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Turkish Music Notation 

Albert Bobowski: a foreigner stayed in Topkapi Palace, uses 
Western notation for the first time(1650) to write Turkish music. 
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Automatic Transcription of Turkish Music 

Extremely complex rhythmic structures: 9/8, 7/8, 13/8,... 176/4 

A 

Ex: 3/2/2/2= 9/8 <!'- 



Modem Notation(20th century) 


MUZiK : SadikAga 



New accidentals since a 
minimum of 17 notes (debates 
continue) in an octave is needed. 

Makam specifies the accidentals 
as the Key does in western music. 



Differences from Western music that cause the need for 

modified/new algorithms 

• Ahenk system: non-standard 


diapozon 


1.5 


Ereedom of the player, use of 
non-standard instruments 


Comparison with ussak template 


Niyazi Sayin 


— makamTemplate 
- actualHistogram 



0 10 20 30 

distance/similarity=0.19118 fO(Hc) 


JV^y TYPE/AHENK 

LENGTH (mm) 

(mm) 

A^=440 cps 

Boldhenk ‘ 

1014-1040 

39-40 

?ieva 

Dariid 

925-936 

35 . 5-36 

caraak 

^a/i 

858-884 

33-34 

seaak - buse/ik 


780-806 

30-31 

dkaak 

Kiz 

702-715 

27-27.5 

last 

Ytldiz “ 

650-663 

25 - 25.5 

arak 

MdstaZ/se// 

598-611 

23 - 23.5 

acem n'^irnii 

S/ipiirde 

572-585 

22 - 22.5 

^7siniu 

Bo!d/ienk N/sf/i/e “ 

520-533 

20 - 20.5 

ueoak 


1.5 


Comparison with ussak template 


1 - 


0.5 - 


Necdet Ya§ar 


makamTemplate 

actualHistogram 



0 10 20 30 

distance/similarity=0.48945 fO(Hc) 
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Differences from Western music that cause the need for 

modified/new algorithms 


More notes ... It is more close to micro-tonal 
music 

Theory is mostly descriptive, it is an oral tradition 
- Mismatch of theory and practice 



- Highly complex makam/mode-key and usul/rhythm structures 


ussak(11 files) 



Differences from Western music that cause the need for 

modified/new algorithms 

• Very frequent use of ornamentations 
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Comparing the two Tuning Systems 




Western 


Turkish 


OK ...there is also an important simplicity: 

Monophonic/heterophonic -> half of automatic transcription 
literature is on multiple fO estimation of polyphonic audio 


Automatic Transcription of Turkish Music 


sound 



Rhythm analysis is excluded for now; our database consists of free 
rhythm solos: taksims. 
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FO detection: 

YIN: de Cheveigne, A. & Kawahara, H. (2002). YIN, a fundamental frequency estimator 
for speech and music. Journal of the Acoustical Society of America, 111 (4), 1917-1930. 


Thanks to authors, we use their 
shared Matlab implementation 


Additional filters: octave 
correction, filtering out 
estimations at low energy regions, 
continuity criteria. 



Checking the quality of fO detection 
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Histogram construction 


k=\ 

W, =l,/„^/o[^]</„-n 

= 0 , otherwise 


53Tet: Logarithmic division 
of an octave into 53 equally 
spaced intervals. 



1/9 Holder comma resolution 



Resolution of the highest resolution tuning system proposed for 
Turkish music: 159tet (Yarman, 2008) 


A simple idea to perform tonic detection, makam recognition and 

tuning analysis 

Histogram alignment, matching and template construction via cross¬ 
correlation 



-100 -80 -60 -40 -20 0 20 40 60 

karat sesine gore He cinsinden uzaklik 
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Various distance measures tested for alignment: 


Lj-norm: 

r/[n] 

= + k\ 

^ k=^ 

L2-norm: 


\= ^ ky 

V k=0 

Intersection: 

r/[n] 

\= + k} 

^ k=0 

K-1 

Bhattacharyya 

distance: 


\=-logjXylK[khth + k\ 

k=0 

Cross-correlation: 

£/[«] 

+ 

II 



•Bozkurt, B., 2008, "An automatic pitch analysis method for Turkish maqam music," Journal of New Music 
Research. 

•B. Bozkurt, et al, 2009, "Weighing Diverse Theoretical Models On Turkish Maqam Music Against Pitch 
Measurements: A Comparison Of Peaks Automatically Derived From Frequency Histograms With Proposed Scale 
Tones", Accepted for publication in Journal of New Music Research. 

•A. C. Gedik, B. Bozkurt, 2009, "Pitch Frequency Histogram Based Music Information Retrieval for Turkish Music 
", Accepted for publication in Signal Processing. 
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The AEU system scale intervals: [0 5 17 22 31 35 44 53] He 



Given a recordings pitch 
distribution, align the two 
distributions 



Examples 


\vols.2_3disc1\02_cecenl<izihuseyni_tanbur.wav 


\vols.2 3disc1\05 rasttaksim tanbur.wav 



-20 -10 0 10 20 30 40 50 60 70 

huseyni Template Matched, Intervals In Holdrian Commas 

\vols.2 3disc3\D2 saba.wav 




0.035 - 


0.03 ■ 


o 0.025 


0.02 ■ 


S 0.015 - 


0.01 


0.005 - 


\vols.2_3disc2\04_tahirbuselikpesrev_k.wav 

I I I I I-1-1 I-1“ 


^ ll l' 


f, !'i -'! 



-20 -10 0 10 20 30 40 50 60 

tahirbuselik Template Matched, Intervals in Holdrian Commas 
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Errors and their contribution in the histogram template 


\vols.2 3disc5\14 ussak k.wav 
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Testing the Tonic Detection on Synthetic Signals (from MIDI) 


RAST (24 songs) 

CrossCorr. 

City Block 

Euclidean 

Intersection 

Bhattacharyya 

Mean Tonic FO(Hz) 

110.05 

109.59 

109.51 

109.59 

109.61 

Std (Hz) 

0.42 

0.16 

0.19 

0.16 

0.17 

MaxDist(Hz) 

1.12 

0.26 

0.34 

0.26 

0.27 

SEGAH (22 songs) 






Mean Tonic FO(Hz) 

276.62 

274.71 

274.44 

274.71 

274.16 

Std (Hz) 

0.48 

0.6 

0.5 

0.6 

0.43 

MaxDist(Hz) 

1.18 

1.27 

0.95 

1.27 

1.23 

Overall (150 songs, 9 
makams) 






Mean-Std(Cents) 

56.39 

2.75 

54.62 

2.75 

2.42 

Max-MaxDist(cents) 

579.9 

7.96 

583.01 

7.96 

7.72 

# false tonic peak det. 

4 

0 

4 

0 

0 


Tests on 118 real recordings: 

# false detections 1111 2 


Makam recognition via simple template matching 


The evaluation results of the makam recognition system. 


Makam type 

TP 

IN 

FP 

FN 

R 

P 

F-measure 

liicaz 

14 

150 

2 

6 

70 

88 

78 

rast 

14 

151 

2 

5 

73 

88 

79 

segafi 

17 

149 

3 

3 

85 

85 

85 

kiirdili fh 

10 

145 

11 

6 

63 

48 

55 

liuzzam 

10 

152 

6 

4 

71 

63 

67 

nihavend 

14 

143 

11 

4 

78 

56 

65 

liiiseyni 

10 

146 

6 

10 

50 

63 

56 

Lt^^ak 

15 

138 

10 

9 

63 

60 

62 

saba 

16 

150 

1 

5 

76 

94 

84 

Mean 

13 

147 

6 

6 

68 

68 

68 


Distance Mesaure: City-Block(L-1 norm), 


Test method: Leave one out 
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Makam recognition via simple template matching 

Confusion matrix 



hicaz 

rast 

segah 

kUrdili 

hicazkar 

hiizzam 

nihavend 

hiiseyni 

u§§ak 

saba 

hicaz 

- 

- 

- 

1 

- 

2 

2 

1 

- 

rast 

- 

- 

- 

- 

3 

1 

- 

1 

- 

segah 

- 

- 

- 

- 

3 

- 

- 

- 

- 

kiirdili 

hicazkar 

1 

- 

- 

- 

- 

2 

- 

3 

- 

hiizzam 

1 

- 

- 

1 

- 

- 

- 

- 

- 

nihavend 

- 

1 

1 

1 

- 

- 

2 

- 

- 

hiiseyni 

- 

- 

- 

3 

- 

3 

- 

4 

- 

u§§ak 

- 

- 

- 

4 

- 

2 

2 

- 

1 

saba 

- 

1 

1 

1 

- 

1 

2 

1 

- 


# of examples: Hicaz:20, Rast:19, Saba:21,Segah:20,Kurdili 
Hicazkar:16, Huzzam:14, Nihavend:18, Huseyni:20,U§§ak:24. 


Confused Makams 



Highly open to further improvement 

Human performance: not measured 

Average Turkish listener should be around 1/9: by chance 
Interested Turkish music listener should be around %50 percent 
Professional musicians: close to %100 
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Rhytmic parsing, tempo detection 


Audio signal 


«-i 


SF(n) = 2 H(\X(n,k)\-\X(n-l,k)\) 




H(x)^ 


x-\- 


X 


X{n,k) = FT(x„[k]) 


Spectral Flux 
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Rhytmic parsing, tempo detection 


Spectral Flux 


Autocorrelation 

function 



i 



Rhytmic parsing, tempo detection 

Synthetic audio examples 
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If tempo is varying, we can try something like a spectrogram: 

The Beat Spectrogram 


Synthetic 

example 



Real 

example 



Hard to 
process 
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The Beat Spectrogram 


250 

g 200 


2 

o 

at 


g-150 



50 


Begining and end 


500 1000 1500 2000 2500 

number of frames 



Can be used as a visual tool but hard to design a robust 
algorithm to perform automatic detection 


Last words 

Music technology is not much developed for 
non-western music, there is a lot of room 
for research. 

Our simple histogram template matching has 
applications in: Tuner design, instrument 
standardization, musicology studies on 
style, historical changes, etc. 
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